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1 What's it all about? 



A beaker full of water, a diamond, a wire, or a litre of helium are all complex dynamical 
systems with well defined bulk properties such as volume V, pressure P, temperature 
T, thermal conductivity k, electrical conductivity a, etc. We know experimentally that 
these properties are connected to one another by relations - for example an equation of 
state that links V, P and T - and we would like to connect these with our understand- 
ing of the dynamics of the systems. That is, we wish to start from the concept that 
a diamond is composed of carbon atoms arranged in a tretragonal lattice and derive 
expressions for a diamond's coefficient of thermal expansion, and its compressibility, 
thermal conductivity etc, in terms of say its temperature and pressure. Statistical 
mechanics is the branch of knowledge that enables us to achieve this goal. We shall 
restrict our attention to the sub-branch of equilibrium statistical mechanics, in which 
we assume that the system under study is in 'thermal equilibrium'. In practice this 
means that we are excluding problems in which different parts of the system are at 
different temperatures or pressures, or, worse still, cannot be characterized by a tem- 
perature at all. The restriction to thermal equilibrium is not a very severe one from the 
point of view of terrestrial experiments, but it is very limiting in astrophysics because 
significant departures from thermal equilibrium are common in the Universe at large. 

The system under study is described mathematically by specifying its Hamiltonian 
H. This is the usual quantum operator associated with energy. What will be unfamiliar 
about H is its complexity: it typically describes a system that has ^ 10^^ dynamical 
variables. For example, the simplest non-trivial model of a diamond would involve the 
positions and momenta pj of iV ^ 10^^ carbon atoms, each of which is connected 
by springs to its four nearest neighbours. Hence the Hamiltonian would be of the form 

AT , 2 4 X 

^ = E |^ + ii:M|x.-x„.(,)|-ap . (1.1) 

i=l ^ j=l ' 

Here a is the natural length of each CC bond, nj(j) is a function that returns the 
index of the jth neighbour of atom i and the factor 1/4 compensates for the fact 
that each spring is included twice in the sums over atoms. This vast Hamiltonian 
operates on wavefunctions that may be taken to depend on the TV vector variables x^: 
'(/'(xi, . . . ,XAr). The eigenstates \(y) of this vast operator are the states in which the 
crystal as a whole has well defined energy. We are unlikely to have explicit expressions 
for the wavefunctions of these stationary states, but we shall pretend that we do. 

The thermodynamic properties of the system turn out to depend strongly on the 
distribution over energy of the stationary states \(y). The energy levels of a finite 
system such as a crystal will be discrete, but the spacing between adjacent energy 
levels becomes smaller and smaller as we consider larger and larger crystals, and in the 
thermodynamic limit of crystals that are arbitrarily large, the energy levels form 
a continuum. In this limit the system's thermodynamics depends on the density of 
states g{E\ which is such that the number of quantum states with energies in the 
range {E, E + dE) is 

dN = g{E)dE. (1.2) 
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Chapter 1: What's it all about? 



Which quantum state is our system in? We do not know, and we are not likely 
to find out, because exquisite accuracy in the measurement of the energy would be 
required to pick one state from the continuum of possible ones. By the uncertainty 
principle, a measurement of E to sufficient accuracy would require an indefinite period 
of time, and is entirely impracticable. Moreover, it is likely that many energy levels are 
highly degenerate, so measurement of the energy would have to be followed by further 
measurements of other observables before we could determine \a) unambiguously. So 
we accept that we do not know which state the system is in and settle for the probability 
Pa that if we were to make the necessary measurements, the system would be found in 
the state \a). 

We identify the internal energy U of thermodynamics with the expectation value 
of the energy that is associated with the probability distribution {pa}'-^ 

U = Y,PaEa. (1.3) 

a 

The energy eigenvalues Ea will depend on thermodynamic variables such as the volume: 
if we compress a beaker of water, we increase each E^ by some amount {dEa/ dV)dLy 
that we could in principle determine from quantum mechanics. Similarly, the energy 
levels of a wire depend on the wire's length /. If the compression or stretching is slow, 
we know from quantum mechanics that the system will remain in the ath eigenstate 
of H{t) as the system is deformed. Consequently, the probabilities p^ do not change 
during a slow, or 'adiabatic' deformation. By equating the change in the beaker's 
energy to the work —PdV that we do during the compression, we find that the pressure 
is given by 



dV 

a 

Similarly, the tension in a wire is given by 



a 



We'll want to apply our results to magnetic systems. The energy levels Eq. of a 
magnetic system depend on the magnetic field B, so B plays a role analogous to that 
of the volume y of a fiuid. The magnetization M plays a role analogous to P in that 
it represents the response of the system to an imposed field B at a given temperature. 
If we increase the field by dB, the system's energy changes by 

d[/ = -M-dB, (1.6) 

where the minus sign arises because the energy is lowest when M is parallel to B 
(magnets align with the imposed field). Comparing this last expression for dU with 
the derivative of (1.3) for an adiabatic change, we find that 

a 

In these notes an object inside curly brackets, such as {pj} means 'the set of pjs'. 
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In more complex systems, the Ea may depend on several variables. For example, 

the energy levels of a diamond crystal depend not only on the crystal's volume, but 
also on its shape: you have to do work to shear it at constant volume. By analogy with 
equation (1.4) there is an analogue of pressure associated with each of the variables 
upon which the energy levels depend. Below we shall focus on the case in which the 
energy levels depend on only one parameter, which we shall call V, and use the symbol 
P for the associated variable. Our formulae can be readily adapted to other physical 
situations by replacing (P, V) with (— r, /), (M, B), or a sum of such pairs of variables. 

We shall see below that the system's temperature and entropy depend on the 
probabilities p^, and we now ask what the values of these should be. 



2 The principle of maximum entropy 

Many practical problems, both in physics and in many other branches of life, such as 
book-making, image-processing and business, can be reduced to the determination of 
probabilities given certain information. It turns out that there is just one consistent 
way doing this - to maximize the 'entropy' of the probability distribution. The Amer- 
ican physicist, J.W. Gibbs was the first to determine probability distributions in this 
way, but he offered no satisfactory explanation of why it was the correct procedure, 
perhaps because illness and death forced him to cut short his book on statistical me- 
chanics. Hence it was left to Shannon, who was interested in communication theory, 
to give a clear account of entropy maximization in 1948. Many physicists have still 
not grasped that entropy maximization lies at the heart of statistical mechanics and 
is a general principle of logical deduction, more basic than any physical application. 

Consider some event that can have n different outcomes xi, . . . , x^^. The event 
might be the throwing of a die, and x might the number obtained {xi = 1, etc). 
We want to assign probabilities to each possible outcome pi, ■ ■ ■ ,Pn that reflect our 
knowledge about the event, and nothing more. If we know nothing about the die, we 
will argue that every outcome is equally likely, and set = | for all i. But we might 
know more; for example, we might know that the average value obtained over 10^ 
throws is 3.667, which differs significantly from the value 3.5 expected of an unbiased 
die. What pi should we then choose? If pi = |, P4 = ^, Pe = | and every other 
Pi = we would have {x) = 11/3 as required. But it doesn't seem likely that the slight 
difference between the mean value of x and 3.5 arises because the die never lands on 2 
or 3 or 5; it is much more probable that 4, 5 and 6 come up just a bit more often than 
1, 2 or 3. We need a rational way of choosing pi that reflect the knowledge that we 
do have {{x) = 3.667) without making unjustifled assumptions (such as that 2 won't 
come up). 

To achieve this goal we look for a measure ^(pi, . . . ,p^) of the uncertainty, or 
'missing information', associated with a probability distribution and maximize this by 
varying the pi subject to whatever data we have (in the case of the die (x) = 3.667). 
We require that S have the following properties: 

• S must be a continuous, symmetric function of the p^; 
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• S should be largest when every outcome is equally likely, i.e., when pi = 1/n for 
all i. We define 

5(^...,^) = 5n (2.1) 

and require that > 5^ (more possibilities implies more uncertainty). 

• S shall be consistent in the sense that it yields the same uncertainty when there 
are different ways of enumerating the possible outcomes of the event. 

To grasp the essence of the last requirement, consider an experiment with three possible 
outcomes 2:1,3:2 and to which we assign probabilities pi,P2 and ps, yielding an 
uncertainty S{pi,p2,P3)- We could group the last two outcomes together and assign a 
probability P23 = P2 + P3 to getting 3:23 (i-e., X2 or X3), giving uncertainty 5'(pi,p23)- 
To this uncertainty we have to add that associated with resolving the outcome X23 into 
either X2 or X3. The probability that we will have to resolve this uncertainty is p23) 
and the probability of getting X2 given that we have a:23 is P2/P23, so we argue that 

-5(^1,^2,^3) = 5(^1,^23) +P23S( — , — ) (2.2) 

^P23 P23^ 

This equation is readily generalized: we have n possible outcomes xi,...,Xj^ with 
probabilities pi,...,Pn- We gather the outcomes into r groups and let yi be the 
outcome in which one of was obtained, 1/2 the outcome in which one of 

Xkt^+i ■ ■ ■ •Xk2 was obtained etc, and let Wi denote the probability of the outcome yi. 
Then since the probability that we get xi given that we have already obtained yi is 
Pi/wi, we have 

S{P1, . . .,Pn) = S{wi, ...,Wr)+ WiS{pi/wi, . . . ,PkJwi)+ 

h WrS{pn-kr/Wr, ■ ■ ■ ,Pn/Wr)- 

Since 5 is a continuous function of its arguments, it suffices to evaluate it for 
rational values of the arguments. So we assume that pi = rii/N, where = iV by 

the requirement that the probabilities sum to unity. Consider a system in which there 
are N equally likely outcomes, and from these form n groups, with rii possibilities in 
the ith group. Then the probability of the group is pi and the probability of getting 
any possibility in the ith group given that the ith group has come up, is Hence 
applying (2.3) to the whole system we find 

n 

S{l/N, . . . , 1/N) = 5(pi, ...,Pn) + Y^PiS{l/ni, . . . , l/m) (2.4) 

i 

or with (2.1) 

n n 

S{pi,...,Pn)=SN-^PiSni (^N = ^niy (2.5) 

i i 

This equation relates S evaluated on a general argument list to the values that S takes 
when all its arguments are equal. Setting all the rii = m we obtain a relation that 
involves only Sn- 

Sn = Snm ~ Sj^. (2-6) 
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This functional equation is solved by Sn = K\nn, where K is an arbitrary constant. In 
fact, one may show (Appendix A) that this is the only monotone solution of (2.6). We 
set K = kB (Boltzmann's constant) and conclude from (2.5) that (up to a multiplicative 
constant) the unique measure of uncertainty is 

n 

S{pi,...,Pn) = kslnN - kB^Pi ln?ii 

i 

= -kB^Pi In Pi. 

i 

A vast number of practical problems can be solved by applying the principle that 
the probabilities of outcomes should be chosen so that this function S has the largest 
value that is compatible with whatever prior information we may have about the 
probabilities. 

3 The canonical distribution 

We now consider the situation in which there are just two constraints on the probabil- 
ities Pa'. 

^Pa = l ; ^PaEa = U (3.1) 

a a 

The first of these constraints simply states that on measurement of the energy, the 
system would be found to be in some state |q;), and the second condition states that 
the system has internal energy U. We use the method of Lagrange multipliers to 
maximize 

S/kB = -^Pa'^^Pa (3.2) 

a 

subject to these constraints. Thus we multiply the left side of each constraint equation 
by a Lagrange multiplier (A or /3) and subtract it from S/kB before varying each p^. 
The Lagrange multipliers allow us to equate the coefficient of dp^ to zero for every a. 
In this way we find 

= -(lnp« + l) (3.3) 

Solving for p^ we have 

g — /3£!„ 

Pa = — — where Z = e^+^ (3.4) 
Inserting this solution into the first of the constraints (3.1), we see that 

Z = J]e-/5^«. (3.5) 

a 

Z is called the partition function - we shall find that it plays a central role in 
statistical mechanics. 



= -^B ^Pi(lnni - In TV) 

^ (2.7) 
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Exercise (1): 

Use the principle of maximum entropy to show that the biased die described above 
should be assigned probabilities 

1 - p-/3 



where /3 is the solution of 



6 1 
3.667 = 1 - - + 



e6/3_i e/3-1' 

[The probabilities pi to pg are 0.144 0.152 0.161 0.171 0.181 0.191.] 

If our system is large enough for its energy levels to be treated as a continuum, 
we may write 

)e-^^. (3.6) 



J dEg{E)i 



We now hypothesize that the quantity S that is defined by equation (3.2) is the 
thermodynamic entropy. When we use (3.4) to eliminate Inp^ from (3.2), we get 

S = A;b J]]pa(/3£^Q + ln^) = kB^U + kBlnZ 

. " (3.7) 

kBP 

In thermodynamics the definition of the Helmholtz free energy F reads 

U = TS + F. (3.8) 
If equations (3.7) and (3.8) are to be the same, we must have 

/3=^— and F = -kBT\nZ. (3.9) 
kBT 

These two equations are of fundamental importance. For future reference we note that 
the second equation can be written 

lnZ = -/3F, (3.10) 

so (3.4) yields 

\np^=/3{F-E^). (3.11) 

Consider the effect on S of changing the probabilities p^. Bearing in mind the 
first of equations (3.1), we have from (3.2) and (3.11) that 

d5' = -^B^dpalnpa = -kB^^dpaiF - Ea) = kB^^E^dpa- (3.12) 
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Consider finally the change in U under an arbitrary variation in V and the p^: 

dC/ = V (E^dp^ + ^Padv) 

a ^ 9V J (3.13) 

= TdS - PdV, 

where we have used (1.4) and (3.12). Thus our hypothesis that S is the thermody- 
namic entropy is vindicated in that it enables us to recover the basic equations of 
thermodynamics. 

In practical calculations the most efficient procedure is to calculate Z first, which 
immediately yields F from (3.9). Then we have S and P from 

U then follows immediately from (3.8). 

3.0.1 \nZ as a generating function We have just seen that knowledge of Z enables 
us to calculate many of the thermal averages of thermodynamics without explicitly 
recovering the p^. Actually, by differentiating \nZ w.r.t. — /3 we can recover other 
useful quantities. The first derivative is 

d\nZ 



9/3 Z 
Differentiating again we have 



^-J2Eae-^''- = {E) = U. (3.15) 



9/3^ " Z^^ - ^ ' (3.16) 

= (E^) - {Ef = {AEf 

Thus the first derivative of In Z is the expectation value of the energy, which we have 
identified with the internal energy U, while the second derivative is the mean-square 
var iation (AE)'^ of the energy. The third derivative would give an interesting third 
moment of E (try it!) and so on. 

The energy of a body in thermal equilibrium with a heat bath at temperature 
T = I/UbP fluctuates because equilibrium is established by exchanging energy with 
the heat bath. In as much as we expect the internal energy of a thermodynamic system 
to be well defined, we expect (AE) <^ (E). Below we shall check that this is so for 
some specimen systems. 

The specific heat at constant volume is 

r =(^\ f^dU _ (3 d^lnZ _ {AE)^ 

^ KdTJv T dfi T 9^2 ^^^2 ■ y-^- ') 

Thus, the mean-square fiuctuations in energy are proportional to the specific heat. The 
key point is that the r.m.s. energy fiuctuation is the geometric mean of k-oT, which 
is small, and CyT, which is generally of order U. During a phase change, Cy may 
become very large (even formally infinite) and fiuctuations in E may be macroscopic 
in size. 
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Exercise (2): 

Use equations (3.7) and (3.17) to show that Cy = —P{dS/d(5) and explain the 
significance of this result in terms of the usual thermodynamic definition of en- 
tropy. 

3.0.2 Linear response theorem In the case of a magnetic system there will be a term 
in Ect that is proportional to the externally applied magnetic field B: that is, Ea is of 
the form 

E^ = e^-Bm^ ^ ^ = ^ e-''(^«-^-«), (3.18) 

a 

where is the contribution to the energy of the state \a) from internal interactions 
and ma is the dipole moment of |q;). Differentiating \nZ w.r.t. B and using (3.4) we 
see that 

, . ld\nZ , , 

M = (m«) = -g-g^. (3.19) 

Differentiating again we have 

X^^-^ = P{{ml)-{maf) (3.20) 

Thus the susceptibility x '^^ proportional to the mean-square fluctuation in the mag- 
netization. A similar proportionality holds for any system between the coefficient of 
the linear response of a system to an external stimulus such as B and the amplitude of 
the fluctuations in the response. This result is known as the linear response theo- 
rem. One can understand it physically by considering that the fluctuations in rUa are 
responses of the system to random stimuli from the heat bath with which the system 
is in thermal contact. The more rapidly the systems responds to a given stimulus, the 
larger the resulting fluctuations are going to be. 

3.0.3 Composite systems The system under study sometimes consists of two parts 
that are mutually independent in the sense that the eigenstates of the system's Hamil- 
tonian are simply products \a'^) = |q;)|7) of eigenstates of the Hamiltonians Hi and 
H2 of the parts, with eigenvalues E^-y = E^^ +E^p^ that are sums of the corresponding 
eigenvalues for the parts. On calculating the partition function of the whole system 
we find 

Z= e"^-^"^ = J]]e-^(^«'+-^^"^ 

states |a7) a, 7 (3.21) 

= ZiZ2- 

Thus the partition function of the composite system is simply the product of the 
partition functions of its component parts and the probability p^^ that the whole is in 

the state \a'y) = |q;)|7) is Pa^ = p^a^p^^ • By induction it immediately follows that the 
partition function of a system of N independent parts is the product of the partition 
functions of the parts. We have seen that most thermodynamic quantities depend on 
In Z. so these will be the sum of the corresponding quantities for the component parts: 
in the language of thermodynamics, quantities depending on In Z will be extensive. 
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3.1 Spin-half paramagnet 

We consider an array of TV spin-half dipoles in a uniform magnetic field B. Each dipole 
has two stationary states 1+) and |— ), with energies ±nB, where /i is the magnetic 
moment of a dipole. Consequently, the partition function of a single dipole is 

^(1) ^ ^ ^ 2 COSh(/3/i5). (3.22) 



From (3.21) it now follows that the partition function of the array is 

Z = 2^ cosh^(/3/i5) ^ \iiZ = N ln[cosh(/3/i5)] + constant. (3.23) 

With (3.15) and (3.16) we have 

U ={E) = -NtiBtanh{/3tiB) ; {AEf = N{tiBf sech'^{/3tiB). (3.24) 

At low temperatures, (3 is very large and the tanh will be ~ — 1 for [iB < and ~ +1 
for [iB > 0. Consequently, U ~ —N\fiB\ regardless of the sign of fiB. As regards the 
fluctuations in we have 



\AE\ 



U y/N\smh{PiiB)\ 



(3.25) 



which is small if N is large provided ^[iB ^ 0. 
From (3.17) the specific heat is 

Cv = NkBi/SfiBf sech^i/StiB). (3.26) 

This is small unless is small, when it is ~ NkBi^fJ'B)'^ . At low temperatures 

(large (3fiB) the dipoles are all aligned with the field and increasing the temperature a 
little does not induce a significant number to increase their energy by aligning against 
the field. At high temperatures, nearly equal numbers of dipole are aligned parallel 
and antiparallel to B, so again increasing the temperature makes little difference to U. 
The specific heat peaks when ~ /iB, when increasing T allows significant numbers 
of extra dipoles to align antiparallel to the field. 

We have F = -A;bTAA ln[cosh(/3/i5)] and 

M = - ( — ) =tiN tanh(^/i5) . (3.27) 
V oB ) T 

When |/3/i-B| ^ 1, tanh(/3//i?) ~ ±1, so M = ±N ii because the dipoles are nearly all 
aligned with B. When < 1, M ~ Nfi{(3fiB) is proportional to B and we have 

a constant polarizability x = dM/dB = . Notice the quadratic dependence of x 

on the dipole moment /i: one power of /i arises because the propensity of the dipoles 
to align is proportional to and the other power arises because the magnetization 
generated by an aligned dipole is proportional to /i. 
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Exercise (3): 

Obtain (3.27) from M = -dU/dB [cf. (1.7)]. 

We have seen that for N large E will lie in a narrow range around U = {E). 
It is important to understand how this fact can be reconciled with the fact that the 
probability that any given state \a) is occupied declines exponentially with E^, so 
at any temperature, the ground state is the one most likely to be occupied; yet if 
you measure the energy, you are almost certain to get a value U that at high T 
will significantly exceed the ground-state energy. This apparent paradox arises by 
competition between the density of states, g{E), which increases very strongly with E, 
and the Boltzmann factor e~^^, which eventually swamps its growth. Let's see how 
this works out for the spin-half paramagnet. 

Let there be Ua dipoles aligned antiparallel to B and N — Ua parallel. Then the 
energy is 

E = iiB{2na-N) ^ na = \[^ + N) = -^^-^, (3.28) 

where Eq = —NfiB is the ground-state energy. Thus Ua is a dimensionless measure of 
the system's excitation energy. The number of states at a given energy is the number 
of ways in which Ua locations for antiparallel dipoles can be chosen from N locations, 
i.e.. Us = N\/[nJ{N — na)\]- When Ua is increased by 1, E increases by 2[iB. Hence, 
the density of states is 

g{Eo + 2nBna) 



2nB 2iJ,Bna\[N - Ua] 



IN N -1 N -Ha + l 

X X • • • X 



(3.29) 



2fiB 12 



so on increasing by one, \ng increases by ln[{N — Ua + l) /ua] — In N/ Ha for Ua <^ N. 
On the other hand, In (e~'^^) decreases by 2[5jiB as increases, so the peak in the 
product ge~^^ occurs when ~ Nq-^I^ij-B g^j^jj E - Eq ^ 2N fiBe~^f^i^^ . From (3.24) 
we have 

U-Eo = NtiB[l - tanh(/3/i5)] ~ 2NtiBe-^^t'^ (3.30) 

in agreement with prediction. The magnetization starts to saturate when ^/iB ~ 1 and 
at this point the probability for a state with E^ = U is smaller than the probability 
of the ground state by a factor 



Pa{Ea = Eq) 



exp(-2iV^/i5e-^) - e-^'J < lO""'' . (3.31) 



Yet states with this incredibly small probability dominate Z because they are so ex- 
traordinarily numerous. 

3.2 Ideal Gases 



We now apply the canonical distribution to ideal gases. Undergraduate courses in 
quantum mechanics concentrate on non-relativistic, single-particle quantum mechan- 
ics. This is fundamentally absurd since we are either interested in large numbers of 
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interacting non-relativistic particles (for example the electrons in a metal, or the atoms 
in a crystal) or we are interested in a small number of ultra-relativistic particles (as 
in an accelerator). You might guess that handling the dynamics of 10^^ particles is 
hard (we're making a stab at it right now), but that doing relativistic quantum me- 
chanics shouldn't be hard - once you're used to thinking in terms of events, relativistic 
mechanics isn't any harder than the Newtonian kind; in fact Lorentzian covariance is 
more easily exploited than is Galilean invariance. The reason we don't teach relativis- 
tic quantum mechanics to undergraduates is that there is no such mechanics for single 
particles! To introduce quantum mechanics into relativity, you have to revolutionize 
your whole world-view and recognize that the Universe isn't made up of particles, but 
of fields. A particle is just an excitation of a field. One of the wonderful things about 
statistical mechanics is that it discovered this vital fact more than a generation be- 
fore high-energy physics did, and it made the discovery through experiments at low 
temperatures, i.e., very low energies. 

We're going to discuss the statistical mechanics of an 'ideal gas'. The system is a 
box full of a quantum field. Excitations of the field might be photons, or neutrinos, or 
helium atoms, or lithium atoms, whatever. But the physical reality is not a box full of 
particles, but a box full of quantum field. ^ 

The field in the box has normal modes in which i() ^ Aif_e^^'^. The wavevector 
k of the mode determines the momentum of the particles through p = Tik and the 
amplitude Ai^ of the mode determines how many particles with this momentum are 
present if the field has half-integer spin (fermionic) there can be or 1 particles. If the 
field has integer spin (bosonic) there can be 0, 1, 2, . . . particles with any momentum 
p. Let i enumerate the normal modes of the box and let the energy of the ith normal 
mode be ei = Q + UiEi, where Q is the zero-point energy of the mode and Ei is the 
spacing of the mode's energy levels. (From the theory of a harmonic oscillator we 
expect Q = Ei/2.) Then the partition function of the box is 

Z=J2 e-''^.(^^+"^^^^ = J]e-^^» Y^U^i ^^^^^ = (3.32) 

{rii} i {rii} i 

We have seen that thermodynamic quantities depend on In to which the zero-point 
energies of the modes contribute a term 

AlnZ = -/^J^O- (3.33) 

i 

The corresponding contribution to the internal energy is 

AC/ = -^AlnZ = 5^0, (3.34) 

i 

^ Some will find this an eccentric position: is there really a quantum field for lithium atoms? Two 
replies are effective: (i) in recent experiments on Bose-Einstein condensates of Li and other atoms, 
the fundamental mode of the Li field in an atomic trap is manifest; (ii) at a deeper level a Li atom 
is a complex non-linear interaction of the quark, electron, photon and gluon fields. These fields are 
mildly relativistic even if the Li atom is stationary. Adding a Li atom involves increasing the number 
of excitations of the quark etc fields in well defined amounts, so the number of Li atoms is subject to 
the rules that govern excitations of relativistic fields. Z for a Li-atom trap can be considered to be a 
sum over carefully chosen states of the quark etc fields. 
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which does not depend on temperature. By equation (3.7) there is no contribution 
from the zero-point energies to S, while equation (3.8) imphes that the contribution 
to F is the same as that to U. Under most circumstances the contribution (3.34) to 
U and F are not physically significant,^ and we shall ignore them in the following by 
taking the partition function to be 



En 

{n-i} I 



where Zi 



(3.35) 



It is easy to do the sum over the number sets {ui] in (3.35) so long as the only 
restrictions on the Ui is that inherent in the field being bosonic or fermionic. The key 
step is to recognize that 



^ = En-." =n( E 

{rii} I I 



n=0, 



]^(1 + Zi) for fermions, 

i 

11(1 -Zi)-^ forbc 



)OSons, 



(3.36) 



which implies that 



ln(l + Zj) for fermions, 

i 

ln(l — Zi) for b 



osons. 



(3.37) 



The mean number of particles in the ith mode is 



(rii 



1 d 

IdEi 



1) 1 for fermions, 
_ 2^-1 bosons. 



(3.38) 



The internal energy of the box is 

U = -^-^ = Y.^ni)Ei. (3.39) 



By (3.14) all the thermodynamic variables are obtainable from 

F = =FA;Br^^ln(l ± Zi) (upper signs for fermions) (3.40) 

i 

At/ will be physically manifest if the depend on some parameter. When the electron/positron 
field is bounded by two grounded capacitor plates, the Q are functions of the distance d between the 
plates, so AU{d). If fact, AC/ decreases with d, so the plates attract one another. This attraction is 
called the Casimir effect. Since the sum over modes should in (3.34) should be continued at least 
up to modes with wavelengths comparable to the Planck length {GTi/ c^)^/'^ , AC should be large. 
An important current problem is why AC/ does not manifest itself as a cosmological constant ~ 120 
orders of magnitude larger than that measured experimentally. 
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To proceed further one has to specify the Ei. Black-body radiation is a simple 
case: then Ei = TiUi = TikiC. In evaluating the sums we argue that in thermodynamic 
limit the system (box) is very big, so its levels Ei form an effective continuum. Then a 
negligible fraction of any sum comes from the lowest few modes. (This statement would 
be false if the side length L of the box was such that ph2'Kc/L > 1.) It is simplest 
to impose periodic boundary conditions on tp at the walls of the box [tp{x + L, y, z) = 
tp{x,y, z)], which implies that ki = 2m7r/L, where m is an integer. Then there is one 
allowed value of k in the A;-space volume (Ak)^ = {2'k/L)^ = {2'k)^/V, and the density 
of modes is given by 

no. 01 modes = spm factor x V - — -r- = 2v — — -r- (photons) (^J-^l) 

(27r)'^ (27r)'^ 

Hence we find for the free energy density of black-body radiation 

F/V=^ r dk e ln(l - e-^^^'^) = ^l^'^/^ /°°dxx2ln(l -e-^). (3.42) 
We integrate by parts to find 



= -3(^ y ^ = -^5(M^' ^'-^'^ 

where we have used the fact that the integral equals 7r^/15. From (3.14) and U = 
F + TS we find that the other thermodynamic variables are 

S/V = -4^^ ^ U/V = -3F/V; P = -F/V = \U/V. (3.44) 



It is interesting to compare this result with that for a population of zero rest-mass 
neutrinos. The spin factor in (3.42) is now unity, so from the free-energy of fermions 
we have 



F/V = - 'J'J , / dx -. (3.45) 

where the integral evaluates to —3! '^i^{—)^/n'^ = 5.683. Since this expression differs 
from (3.43) only in the numerical factor (smaller for fermions by a factor 0.4375), 
the relations (3.44) between the thermodynamic variables apply also to zero rest-mass 
fermions. 

3.2.1 Case of a specified number of particles We may know how many particles we 
have in our box - this would be case if the particles were electrons in a lump of metal, 
or helium atoms in a dewar. The partition function (3.37) is too large because it 
includes states in which the Ui takes all values, and we want to sum e~^^°' only 
over states for which Yl- rii = N. 
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Let be a complex variable and consider the function 

^"''^ ' (3.46) 
= JJ(1 ± Ce-Z^^i)^^. for fermions) 



For C = 1 this coincides with our unrestricted sum (3.36). More generally it is a power 
series in with one factor of (, for each particle. Hence, the coefficient of (^^ is precisely 
the restricted sum we wish to evaluate. 

One of Cauchy's theorems states that if f{Q is a function that is analytic in the 
neighbourhood of the origin, then the contour integral 

yields the coefScient of (^^ in the expansion of / in positive and negative powers of 
Here the contour of integration goes around the origin so as to include the singularity 
at the origin (due to the factor (^~(^+^)) and no other. Hence, our problem has been 
reduced to doing a contour integral. 

We evaluate the integral using a technique called "the method of steepest descent" , 
which is approximate for finite N but becomes exact in the thermodynamic limit 
TV oo. In this method we write our integral in the form f d(e^^'^^ and approximate 
it by a phase factor times e^ y/2TT/\g"\ evaluated at a stationary point of g (see Appendix 
B). In our application the function g is 

g{() = -{N + l)\nC±J2H'^^C^~^^') (+ for fermions) (3.48) 

i 

Differentiating we find 

N + 1 v-^ e-^^i 

9(0 = — — + l^Y±c^ 

' (3.49) 



For small real, positive C,, the first term in g' dominates and g' < 0. Sufficiently far 
from the origin, the sum term dominates because there are arbitrarily many modes 
(and in the case of bosons the first denominator may become small). Hence at some 
distance from the origin g' > and g has a stationary point when 







N + 1 V- e-l^^ 
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Figure 1 (rij) for a gas of Fermions for two values of Ep. 

Taking logs we then find with (3.47) that 



InZjv ^ -{N + l)\nC ±^H1 ± Ce-^^') - ^\n{2ng") 



(3.51) 



Now we allow N to increase as we approach the thermodynamic limit. Eq. (3.49) 
shows that the value of g" at the stationary point of g grows with N so the method of 
steepest descent becomes ever more accurate. Moreover, in (3.51) the first term and 
the sum grow as N, while the term involving g" grows only as In AT^. Therefore we may 
neglect the latter and write finally 

InZiv = -7VlnC±^ln(l iCe'^-^O^ (macroscopic iV) (3.52a) 

i 

where from (3.49) C is the number for which 

AA = y-— J— — . (3.52b 

i 

To make contact with thermodynamics, we introduce a new variable /i that is 
defined by C = Then we have 



InZjv = -AA/3/i±^ln(l±e-''(-^'-'^^) ; = ^ 



Q^iEi-ti) _|_ I ■ 



(3.53) 



The terms in the sum for N are the (rii), as one can show by evaluating (rii) = 
—P~^dlia.Z]\^/dEi, bearing in mind that through (3.53) /i is a function of the Ei. We 
have 



dlnZ 



N 



Sjj - dfi/dEj 

Ql3{Ej-ti) _|_ I 



(3.54) 
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By second of equations (3.53), the coefficient of d/i/dEi vanishes, so 

(rii) = pf^E -fj,) j- \ (upper sign for fermions). (3.55) 

In the case of a gas of Fermions, {ui) ~ 1 for E'j < /i and {ui) ^ Q-Pi.^i-i^') for Ei ^ fi 
(see Fig. 1). 

In the case of a gas of bosons, (rii) becomes very large as Ei approaches fi. Since 
there can be no upper bound on E^ and it is clear physically that (n^) < N, we infer 
that for bosons fi < Eq, the energy spacing of the fundamental mode. For Ei ^ fi we 
have (rii) ^ Q-Pi^i-iJ-) as in the case of Fermions. 

Boltzmann believed that the number of molecules in the box with energy E was 
oc e~^^ times the phase-space volume associated with energy E, Q{E)dE. This propo- 
sition is consistent with the limiting forms of (rii) slI E ^ providing the number of 
states at energy E, g{E)dE = U{E)dE. This is in fact the case. 

We now evaluate the sum over modes by assuming that we are working in the 
thermodynamic limit of a large box, so the modes form a continuum. We assume that 
our particles are non-relativistic,^ have mass m and two spin states. Then the density 
of modes is given by (3.41), and we have Ei = {hk)'^/2m, so (3.53) becomes 

N V = — dk ^ = ^ / dE 



; poo 

7±(/3At) where f±{y) = dx- 



(3.56) 



Figure 2 shows f±. 

N/V is a monotonic increasing function of /i because for both fermions (upper 
sign) and bosons, increasing /j, makes the denominator of the integrand smaller at any 
E. In the case of fermions there is no upper bound on //, and N/V can be made as 
large as we please. In the limit of large N/V, f{y) ^ ?/^/^ so N/V ^ kBTf{l3fx) ^ /x^/^, 
and [1 becomes independent of T. Quantitatively, fi tends to the Fermi energy 



Et 



(37r2)2/3;,2 i^x2/3 



2m \V 



3.2.2 Bose-Einstein condensation In the bosonic case, we have seen that the largest 
allowed value of fi is zero, so there is a largest permitted value of N/V, namely 

^^,^^/y=^J^f_,0)=Z,S^^^. (3.58, 

How are we to understand this? At large T, N^ax is large, so let's stuff a good number 
of bosons into our box at a high enough T for this number to be permitted by (3.58). 
Now let's cool the box. As T drops to the critical value 

= ?-2^(7V/y)2/3 (3.59) 
kBm 



4 



In the relativistic case, pair creation will ensure that N fluctuates rather than being flxed. 
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Figure 2 The function /.j- defined in eq. (3.56). At large negative y, f ~ eU for both 
bosons and fermions. As y — ^ the bosonic function /_ tends to 3.28. At large y the 

fermionic function, /_|_, asymptotes to / = §2/"*/^ that is shown dotted. 

^ and A^max N. What happens when T drops below Tc? A macroscopic 
number of particles crowds into the fundamental mode, and the approximation we 
have implicitly made, that {rii) is a continuous function of Ei breaks down. In fact, at 
T <T(. the number Nq in the fundamental mode is given by 

7Vo(r) =7V-7V„,,(r). (3.60) 

This crowding of particles into the fundamental mode is called Bose— Einstein con- 
densation. 

Exercise (4): 

Show that at Tc, L/A = 0.997, where A is the de Broglie wavelength associated 
with Tc and = V/N is the mean interparticle separation. 

The condensation occurs only at low temperatures and high particle densities, 
which are precisely the conditions under which inter-particle forces, which we have 
neglected, are important. The lowest temperatures on Earth are attained in atomic 
traps (T ~ 1 fJ.K), and in these condensation can be observed at densities low enough 
(n ^ 10^° m~^) that inter-particle forces do not lead to the formation of clusters of 
particles within the lifetime of experiments. Consequently, condensates that are well 
described by our simple theory can be studied. For an accessible introduction to this 
very active field, see Burnett et al.. Physics Today, Dec 1999, p. 37. 

Above about 2 K liquid ^He is a normal viscous liquid, called He I. But when ^He 
is cooled through about 2 K (the exact temperature depends on the pressure), a 'super- 
fluid' component starts to form in the liquid; that is, below 2 K the liquid appears 
to contain a component that flows without viscosity. This zero-viscosity component 
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coexists with ordinary, viscous ^He rather as water coexists with steam. ^ The lower 
the temperature, the greater the proportion of ^He that is in the superfluid condensate 
[cf (3.60)]. *He that contains some of the superfluid condensate is called He II. 

The superfluid condensate in He II is made up of '^He atoms with momenta so 
small that their de Broglie wavelengths are macroscopic in size. These delocalized 
atoms move coherently and are insensible of the small-scale structures which dominate 
the scattering of normal, thermally excited helium atoms, for the same reason that 
infrared radiation passes unscattered through a cloud of small water droplets. 

In 1911 Kamerlingh Onnes discovered that mercury lost all trace of electrical re- 
sistivity when cooled through about 4.2 K. Many other materials have subsequently 
been found to become such superconductors at low temperatures, and in recent years 
this phenomenon has become of considerable technological importance. The standard 
BCS theory of superconductivity has it that at low temperatures electrons with op- 
positely aligned spins form Cooper pairs by exchanging phonons. Each Cooper pair 
is a charged, spin-zero particle. Since Cooper pairs are bound by phonons, which 
by electronic standards are slow-moving beasts, they form with near-zero momentum. 
Consequently, as soon as significant numbers of Cooper pairs are present, they con- 
stitute a Bose condensate, which is in some ways analogous to He II. Since this Bose 
condensate is charged and can flow without viscosity, all trace of electrical resistance 
vanishes when the condensate forms. 

3.2.3 Degenerate matter To derive the equation of state of degenerate matter we use 
(3.53) to evaluate InZjv in the limit of a continuum of energy levels. The calculation 
is closely analogous to that of (3.56): 

, m^/^y r°° , \ 

F = -kBT( - Nl3ti+ 3-/ dEViElnfl + e-^*^-'^)]) (3.61) 

^ n'^h Jo ' 

The integral is dominated by the portion at £^ < /i in which we can approximate 
ln(l -I- e-^^-^-/^^) by /3(/i - £), so we have 

F^iV/.-^/ AE^{i.-E) = Ni.- '^ l^lll ■ (3.62) 
From (3.56) and Fig. 2, in the degenerate limit N jV is 



V 3n^n^ 
Substituting this into (3.62) we flnd 



(3.63) 



^ The analogy is inexact — in particular, no latent heat is involved in the passage of He atoms in 
and out of the condensate. 
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Taking logs and differentiating (3.63) w.r.t. V we find tfiat d[i/dV = —^[i/V, so 

5 y -n(2m)3/2>' W) ~ 5m W ) ' ^"^"^^^ 

Tfius, for a degenerate gas PV^ = const, where 7 = |, just as for any otfier ideal 
monotonic gas; but 7 is not now the ratio of the principal specific heats. 

Let's apply these results to white dwarf and neutron stars, in which the thermal 
energy density is dominated by degenerate electrons and neutrons, respectively. Since 
F is independent of T, S = —dF/dT = and U = F. The gravitational potential 
energy Eg of a star of mass M and radius i? is ~ —2GM^/hR. Its equilibrium radius 
is that at which dEo/dR = -dU/dR. Now 

dU dF dFdV 2 

dR = dR = WM = -^'"^ ' ^'-''^ 
so the equilibrium radius is that at which 

|^^~4rfp ^ 4nR^P-lGM\ (3.67) 

We now use (3.65) to eliminate P and put M = Nuih to find 

35/^(37r/4)2/3;i2 OS ruH , , 

R ~ ^ttV ^" ^ = 1-96 X lO^^— ^ m. 3.68 

In a neutron star with M ~ Mq = 2 x 10^*^ kg, iV^/^ 10^^, so i? 20 km. The radius 
of a white dwarf of the same mass should be larger by mi^/me ^ 1800, and thus be 
~ 40 000 km. An important feature of (3.68) is the prediction that more massive stars 
have smaller radii. This clearly implies that the Fermi energy rises with N . From (3.57) 
we have that Ep ^ {N /VY^^ /m ^ l/{R.'^m) ^ m, since we have seen that R. ^ m~^. 
It turns out that as M ^ 1.4 M©, Ep mc^ for both white dwarfs and neutron 
stars. Our calculations are invalid in this limit (because we adopted Ei = Ti^k^/2m 



rather than Ei = c\/ m^c^ — h^k'^). When relativistic formulae are used, one finds 
that no equilibrium is possible for M larger than a limit ~ 1.4 Mq that is known as 
the Chandrasekhar mass. 



4 The Grand Canonical Distribution 

An earthed conductor is free to exchange electrons with a large reservoir, so the number 
of electrons in it will fluctuate thermally. Similarly, an iceberg that floats in water at 
0°C can exchange water molecules with the ocean, so the number of water molecules 
in the iceberg will fluctuate. So let's apply the principle of maximum entropy to the 
case in which we add to the constraints (3.1) the constraint 



a 



(4.1) 
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Let the Lagrange multipliers used to apply this constraint on the Pa be denoted 7. 
Then by analogy with (3.3) extremization of S/Rb yields 

= -{\np^ + l)-X-l3E^-^n^ (4.2) 

and solving for p^ we have 

Q-l3Ea-'yna 

Pa = ^ , (4.3) 

where from J^^Pa = 1 we have 

Z = ^e-^^«-^"«. (4.4) 

a 

Z is the grand-canonical partition function. We discover its physical interpreta- 
tion by evaluating the entropy of the probability distribution (4.3): 

S/ks = - ^Pa InPa = y^^PaiPEg + -fHa) + in Z 
a a 

= /3U + j{N) + \nZ ^"^'^^ 
U -TS + kBT\nZ = -^kBT{N). 

To proceed further we need to recall the form taken by the laws of thermody- 
namics for systems of variable mass. Thermodynamics is concerned with extensive 
systems: if you double the size of the system you double its volume, entropy, and 
energies (internal, Helmholtz and Gibbs).^ Mathematically, in thermodynamics we 
are concerned with systems for which 

S2 = 2Si ; ^2 = 2^1 ; t/2 = t/(52, ^2) = 2t/(5i, Fi) = 2[/(52/2, ^2/2) 
F2=F{T,V2) = 2F{T,V2/2) ; G{T2, P2) = 2G{Ti, Pi) 

These relations show that G is the simplest energy to work with when the size of the 
system is variable, because it is straightforwardly proportional to the amount of stuff 
in the system. We define the chemical potential fi = G/{N) to be the free energy 
of the system when it is reduced to a single particle, and have 

G = U -TS + PV = n{T,P){N). (4.7) 

Comparing this with (4.5), we see that the equations will be identical providing 

PV = kBT\nZ ; = -^kBT. (4.8) 

^ Real systems are usually not quite extensive: double the mass of a water droplet and you don't 
quite double its internal energy because surface tension (a source of positive internal energy) is less 
important in the larger drop than it was in the smaller. 
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With the second of these equations, we have 

= ]_e-P(E^-i^n^)_ (4.9) 

By analogy with (3.9) we define a new thermodynamic potential 

$ = -jteTlnZ = -Py, (4.10) 

where we have used (4.8). Now $ = -PV = F - G = F - ii{N), so''' 

d$ = dF - d/i(AA) - iid{N) = {-SdT - PdV + /id(AA)) - d/i(AA) - /id(AA) 

= -SdT - PdV - {N)dii. ^^""^"^^ 



It follows that 



Equations (4.10) and (4.12) enable us to obtain any required thermodynamic quantity 
once we know Z{^, /i, V). 

Exercise (5): 

Express the derivatives in (4.12) in terms of the Pa and check that these expressions 
make physical sense. 

4.1 Application to a perfect gas 

We can easily evaluate Z for a box full of ideal gas: we have Ea = J2i{Ci + n-iEi), 
where i enumerates the modes of the box, and ria = rii, so 

Z = exp [ - /3( 5^(0 + UiEi) - /i J]ri,)] . (4.13) 

{m} i i 

As in our evaluation of Z we neglect the Q and then with = Q-P^^i-t^) we have 



{rii} 

It follows that 



bi — \jf J- , . 



InZ = ±^ln(l ie-^^^*-/^)) (4.15) 



When the system size can vary we have dG = (N)dii + iid(N) = -SdT + VdP + jud(iV), so 
dF = d(G - PV) = -SdT - PdV + nd(N) and dU = d(F + TS) = TdS - PdV + nd{N). 



22 



Chapter 4: The Grand Canonical Distribution 



From (4.13) it is clear that we can obtain (rii), the expected number of excitations 
(particles) in the ith mode by differentiating \b.Z w.r.t. Ei: 

1 d In 1 

(rij) = — — — — ; — = r (upper sign for fermions). (4.16) 

This result is identical to equation (3.54). It applies to a different physical problem, 
however: eq. (3.54) applies when the number of particles is fixed at N (hence to the 
electrons in an isolated conductor), while (4.16) applies when the number of particles 
is fluctuating thermally, and only its expectation value is equal to (N) (hence to the 
electrons in a grounded conductor). For very large (N) one might expect the fluc- 
tuations in N to be much smaller than (A^), and might not be surprised that (nj) is 
identical in the two cases. We have not, however, assumed that (N) is large in deriving 
these results. Moreover, by differentiating InZ w.r.t. we can obtain an expression 
for the fluctuations in TV and show that they may be non-negligible: 

i 

,/ A ,.x9v 1 9^ InZ , , 



(e/3(Si-M) ± 1)2 Z^\^W 



In the classical regime, e^^^'~'^^ ^ ^iid have ((AA'')^) ~ (N) as we might 

naively expect from Poisson statistics. If a Bose condensate is present, the flrst term 
in the sum for {{AN)'^) is the square of the number of particles expected in the funda- 
mental mode, which is macroscopically large, so ((AA^)^) is non-negligible.^ In these 
circumstances the agreement between (3.54) and (4.16) is remarkable. 



4.2 Application to an e± plasma 

If you heat matter to temperatures > 10^ K, the thermal energies of particles attain 
values at which the creation of electron-positron pairs becomes a non-negligible process. 
Let's adapt the grand-canonical formalism to this case. Now we don't wish to constrain 
the number of electrons present ria, but the difference — rria between the number 
of electrons and positrons in the box (this should be equal to the number of electrons 
present when the box was cold). Consequently, we replace (4.1) by 

Y.{n^-m^) = (N). (4.18) 

a 

It is easy to see that (4.4) then becomes 

£ — '^^Q-PE^-lin^-ma) (4.19) 
a 

^ For a degenerate Fermi gas, e^'^^-^*"'^) -C (rii), so {{AN)"^) is negligible. 
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We have Ea = ^i{ni + mi)Ei, ria = 'Ylii'^i ^^'^ "^a = 'Ylii'^i^ where the sums are 
as usual over normal modes and rzj and nii are the number of electrons and positrons 
with energy Ei. Hence, (4.13) is replaced by 



exp [-^( J^[ni(£;i-/x) + mi(£;i + /i)])]. (4.20) 



{ni,mi} 

Here we have replaced 7 by — as before. Notice that the first occurrence of ji in 
(4.20) is the chemical potential of the electrons, and the second is minus the chemical 
potential of the positrons; that is, we are asserting that the chemical potential of the 
positrons is minus that of the electrons. This is an incidence of the general rule that 
when species A + i? + • • • react together to produce species a + 6 + • • •, the sum of 
the chemical potentials on the left (here the chemical potentials of an electron and 
a positron) has to equal the sum of the chemical potentials on the right (here two 
photons). Finally for the electron-positron system (4.14) becomes 



1 + e-^(^^-'^')(l + e-^(^^+'^)), (4.21) 



which is simply the product of the partition functions of a box full of electrons, with 
that of a box full of positrons, with the positron chemical potential minus that of the 
electrons. To proceed further we set 

and use (4.16) to sum (nj) over normal modes. The expression we obtain is just the 
first equality of (3.56) with Ei{k) given by (4.22), and the second equality of (3.56) 
with /i replaced by ±/i — mc^ (upper sign electrons). Thus the electron and positron 
densities are 

N^lV=lf!l r de = ^^"'^^^f^' /+[/3(±/i-mc^)]. (4.23) 

The value of fi is now determined from 

(N) = N_-N+= ^ J" J, {/+[/3(/. - mc^)] - /+[/?(-/. - mc^)]}. (4.24) 

The case of greatest interest is that in which the electrons are degenerate [/_|_ (y) ~ 
1?/^/^], and the positrons are in the classical limit [f+{y) — |\/7re^]. Then 



N+ /+[/3(-/x-mc2)] 30Fe-^('^+"^'=') 



N_ /+[^(/i-mc2)] 4[^(/i-mc2)]3/2' 



(4.25) 



We have seen that in a white dwarf /i ~ mc^ when rest-mass energy is neglected, or 
fi ~ 2mc^ when it's included, so (4.25) shows that the positron density is small until 
RbT > 3mc^, as one might naively expect. 
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5 The Microcanonical distribution 



We have seen that unless Cy is anomalously large, the fluctuations in E are ^ \/k-B,TU , 
so they become negligible in the thermodynamic limit {U — > oo). This result implies 
that the partition function is dominated by terms with energies in a very narrow band 
in E. Let's investigate the approximation in which we assume that the only states that 
are occupied are those with E = U. If there are fl{U) such states, our approximate 
value of the partition function is 

Z = n{U)e-^^ (5.1) 



In this approximation 

_ r for ^« / [/, 
P"- \ for Ec, = U. 

Consequently, the entropy is 

S = -kB^^Pa'^^Pa = kB^nQ. (5.2) 

a 

The probability distribution (5.2) and its consequences are called the microcanonical 
distribution. 

In calculations based on the canonical and grand-canonical distributions, one tends 
to consider T (or /3) to be given, and U to be something that one determines from 
it through Z. When working with the microcanonical distribution one regards U as 
given since 12 is a function of U and by (5.2) 5 is a function of fl. To flnd the value of 
/3 that matches the given U we argue from dU = TdS — PdV that 

We find the pressure from 
Exercise (6): 

Show that F = —kBTlnZ, with Z from (5.1) yields the same expression for F as 
F = U -TS with S from (5.2). 

5.1 Dynamics of a rubber band 

As an instructive application of the microcanonical distribution, consider a simple 
model of a rubber band. We imagine the band to consist of N links of length / joined 
end to end. Let the band be laid along the x-axis with one end at the origin, and 
the other at a; = L > 0. Some links point towards increasing x, and some towards 
decreasing x. Let there be pointing to increasing x and n_ pointing towards 
decreasing x, where + n_ = TV. Since the ends are distance L apart, we have 



5.2 Fluctuations 
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— n_ = L/l. A 'state' of the band consists of a particular arrangement of forward- 
and backward-pointing links. There are 

^ ^ ^ + mmN - L/m ^^-^^ 

ways of arranging the required number of forward- and backward-pointing links, so 
with Stirling's approximation and y = L/l, the band's entropy is 

S/kB = Infi ~ N{\nN + 1) - ^{N + y){H^{N + y)] + 1} 

-|(7V-^){ln[|(7V-^)] + l} (5.6) 
= N\nN - ^{N + y) \n{N + y) - ^{N - y)\n{N - y) + N\n2. 

Differentiating w.r.t. y we find 

^ = -lln(iV + ,) + 1 HN -v) = -\ In (^). (5.7) 

According to the equation analogous to (5.4) for a system under tension, the tension 
in the band is 

= -t(—\ f ^ + y ] 

KdLJu 21 ^\N-y) ^gg^ 
= -^[e + 0(e)] where ^ = ]^ = 

in accordance with Hookes law. Unfortunately, for this system one cannot use (5.3) 
to relate the internal energy to the temperature, because it is impossible to vary S at 
fixed y. 



5.2 Fluctuations 

The density within a fiuid in thermal equilibrium fiuctuates — the blue of a clear sky 
arises through the scattering of blue light off just such fluctuations in the density of 
the atmosphere. Imagine that you are using an optical laser to measure these fluctua- 
tions. Light is scattered by fluctuations in the density p{x.), so you seek the probability 
functional^ P[Sp] that will return the probability associated with any given form 5p(x) 
of the difference between the density and its mean value. Since one cannot see atoms 
with visible light, you argue that P should not involve details of the structure and 
disposition of individual gas molecules, but should be expressible in terms of macro- 
scopic, phenomenological parameters such as the temperature r(x), compressibility 
Kr(x), or whatever, averaged within volumes large compared with the wavelength of 
visible light — volumes which contain hundreds of millions of atoms. Correspondingly, 

^ A function f{x) is a number that depends on a number. A functional P[f] is a number that 
depends on a whole function. It is conventional to enclose the argument of a functional in square 
brackets. Often the function is given a dummy argument as well for clarity thus: P[/(x)]. 
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5p(x) is to be interpreted as the mean density within a volume of this type centred on 

X. 

Einstein offered an early solution to this problem of finding the probability func- 
tional P[Sp]. He argued that the probability P of a macroscopic configuration is 
proportional to the phase-space volume compatible with it. Equation (5.2) enables 
us to connect this to the entropy S of the density field Sp: 



P[5p] (xn = e^/^' 



(5.9) 



To find S\5p] one imagines starting with the mean density distribution 5p = and 
changing the energy and specific volume of each fiuid element until the desired form 
5p(x) has been set up. Let lower-case letters u, v. etc., denote internal energy, volume, 
etc. per unit mass. Then from TdS = dU + PdV we have that on setting up 5p{x) the 
net entropy changes by 

A^ = Jd^xp{x)J^ daSs{x) = Jd^xp{x)J^ ^[5u + PSv]^, (5.10) 

where a parameterizes the various stages through which we deform the fiuid to over- 
density 5p. Since the whole system is to be regarded as a microcanonical ensemble, the 
total energy U = J d^x pu and the total volume are constant during the deformation 
parameterized by a. So if r(x) and P(x) were constant, the right-hand side of (5.10) 
would vanish. So we write 



r(x, a) ~ r(x, 0) + a5T; P(x, a) ~ P(x, 0) + a5P 
and have to second order in S 



(5.11) 



AS 



I d'xpix)^' 
I d'xp(x)l 



ST{Su + P5v) 5PSv 



ST5s 5P5v-\ 
+ 



(5.12) 



T T 

When we now eliminate 5s and 5P through 

a Maxwell relation enables us to cancel the terms proportional to STSv, and we have 

AS = - [d'x^\^{5Tr + ^{5vr 
J 21 I 1 ktv J 



(5.14) 



This equation expresses the change in the overall entropy as a function of the magnitude 
of fiuctuations in the local temperature and density. Substituting this into equation 
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(5.9) we obtain a probability functional P[5p,5T]. To obtain the desired functional 
P[Sp] we must sum P[Sp,ST] over all configurations in which Sp{yi) has a particular 
form; in other words, we have to integrate P[Sp,ST] over all possible forms of ST{x). 
This integration merely generates a constant that is independent of Sp{:x.). Hence 
Einstein's final probability functional is 



According to this formula, density fluctuations have a Gaussian distribution with vari- 
ance proportional to the isothermal compressibility kt- This diverges at the critical 
point T = Tc, so the fluctuations also diverge there giving rise to the phenomenon of 
'critical opalescence'. 

5.3 General remarks 

You have seen many examples of the microcanonical distribution in the Part A course, 
so I won't repeat them here. Notice that the philosophical role of the microcanoni- 
cal distribution in the Part A course is entirely different to that here: there it is the 
axiomatic starting point for statistical mechanics, and the canonical distribution is de- 
rived from it by the method of 'ensembles' in which one imagines a dynamical system, 
or ensemble, that is made of a large number of copies of the system of experimen- 
tal interest. When one fixes the energy of the ensemble and assumes that it obeys 
the microcanonical distribution, one can deduce that each copy obeys the canonical 
distribution. 

The approach taken here is that entropy maximization provides the only secure 
foundation for statistical mechanics. The canonical and grand canonical distributions 
follow trivially from it. The microcanonical distribution is a crude approximation to 
the canonical one in that the are not well approximated by (5.2). Despite this, these 
Pa do give accurate results for most macroscopically measurable thermal averages. 

6 Strongly interacting systems 

The models we have been discussing are of limited interest because they omit a key 
aspect of real systems: inter-molecular forces. Such forces make for much richer ther- 
modynamic behaviour, and it is very interesting to understand how complex thermo- 
dynamics of the bulk emerges from simple microscopic interactions. Understanding 
this question also provides insights into related economic and social phenomena. 

6.1 Models 

We discuss models of a magnetic material, a non-ideal gas and a binary alloy that 
all have the same mathematical structure. The dimensionality of the system proves 
to have a profound influence on the thermodynamics, so we introduce a parameter 
d = 1, 2, 3 that gives the dimensionality of the system: d = 1 implies a linear chain of 
sites, d = 2 a planar array of sites, etc. 




(5.15) 
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6.1.1 The Ising model Far and away the most influential model of a system with 
strong interactions is the Ising model. It was first solved by E. Ising in 1925, who 
treated the case d = 1. In 1944 Onsager (1903-1976) solved the model for ci = 2 in the 
absence of an externally applied magnetic field and showed that at its phase transition 
the model differed significantly from the predictions of 'Landau theory', which had 
until then been thought correct. An exact solution for the d = 2 model in non-zero 
external field appeared only in 1989.^° Despite decades of intensive effort, we still have 
no exact solution for the d = 3 model even in the absence of magnetic field. 

The lattice is cubic, and associated with each point of the lattice is a number s 
that is either 1 or —1. The system's Hamiltonian is 



(6.1) 



where B is an externally imposed field, the subscripts label lattice sites, and Jij is 
defined such that 

J _ i J, i and j neighbouring sites, ,^ 
\ 0, otherwise. 

The model's partition function can now be written 



Rising = exp ^ X] ~ i 5Z ^ij^i^j) 



{si} 



(6.3) 



where {si} indicates that the sum should be extended over all possible assignments of 
±1 to lattice sites. The physical picture is of an array of magnets that are obliged to 
be either parallel or anti-parallel to a uniform magnetic field B. 



Note: 

It is tempting to replace the magnets by spin-half dipoles and to associate ?iSi/2 
with the eigenvalues of the spin-half operator s. This is erroneous, however. The 
interaction Hamiltonian of two spin-half dipoles is {AJ /h^)si ■ Sj, and if S = + Sj 
is the spin operator for the pair of atoms, it is easy to show that Sj-Sj = ^{S^ — sj — 
s|). The eigenvalues of 5^ are (non-degenerate) and 21i^ (3-fold degenerate), 
and = Hence the partition function for a pair of spin-half dipoles is 

Z = e^^*^ -|- 3e~'^'^, which differs significantly from that of an Ising chain of two 
dipoles, Z = 2^^^ + 2^'^^ . 

If in equation (6.2) we set i7 < 0, neighbouring spins try to align parallel to one 
another and parallel to B — the model is a ferromagnet. If we set i7 > 0, neighbouring 
spins try to align anti-parallel to one another and (6.3) becomes the partition func- 
tion of an anti- ferromagnet. One may show that the thermodynamic properties of an 
anti-ferromagnetic system are identical with those of the corresponding ferromagnetic 
system. 



Zamalodchikov, Int. J. Mod. Phys., A4, 4235. 



6.1 Models 



29 



6.1.2 The lattice gas The Ising model turns out to be mathematically equivalent to 
the following highly stylized model of a non-ideal gas. 

We divide the d-dimensional space occupied by the gas up into cells of just the 
same size as an individual molecule. Each molecule is obliged to occupy a single cell, 
and no cell may contain more than one molecule. Since the gas is non-ideal, molecules 
attract each other and the energy of the gas is lower when molecules are in adjacent 
cells than when each lives in glorious isolation. We model this state of affairs by 
changing the energy of the gas by 4^7 < for every pair of molecules in adjacent cells. 
Let e-i be zero if the cell i is vacant and one otherwise. Then inter-molecular attraction 
changes the system's energy by 

2j2Jijeiej, (6.4) 



where Jij is defined by equation (6.2). With these definitions, the grand partition 
function of the gas is 

Z =^exp (^Pn '^ei - 2/3^^ JijCiCj^, (6.5) 

{ei} i ij 

where is the chemical potential of the gas and {ci} indicates that the sum should be 
extended over all possible assignments of zeros and ones to the cells. 

The grand partition function (6.5) of the lattice gas is fundamentally identical 
with the partition function (6.3) of the Ising model. We show this by eliminating the 
Ci in favour of the variables defined by 

Si = 2ei - 1. (6.6) 

In terms of the Si equation (6.5) reads 

Z = ^ exp [/3((i/i - zJ)Y,^i-\Y. ^ij^i^j) + - -2^)] ' (6-7) 

{si} i ij 

where N is the total number of cells and we have assumed that each cell has z nearest 
neighbours, {z is known as the coordination number. On a rf-dimensional cubic 
lattice z = 2d.) Apart from the multiplicative factor exp [(3N{^fi — zj)], (6.7) is 
identical with equation (6.3) for the partition function of the Ising model when B = 
- zj. 

6.1.3 fi-hrass /3-brass is an alloy consisting of equal numbers of copper and zinc atoms. 
At r = the alloy consists of two interpenetrating cubic lattices, one of copper and one 
of zinc atoms, arranged such that each copper atom is surrounded by eight zinc atoms, 
and vice versa for each zinc atom. As the temperature is raised, more and more copper 
atoms stray onto the zinc sub-lattice and vice versa, until at 739 K the division into 
two distinct sub-lattices breaks down altogether; above 739 K both sub-lattices contain 
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Figure 3 Projection of a slice through the lattice of /3-brass, well below (left) and near 
(right) the critical temperature. 



equal numbers of each kind of atom (see Figure 3). This system can be represented by 
the Ising model as follows. 

At low temperatures the system is ordered because it is energetically preferable 
for unlike atoms to be nearest neighbours rather than like atoms. Suppose the system's 
energy is lowered by an amount for every bond between unlike atoms on adjacent 
sites, and raised by J' for every bond between like atoms on adjacent sites. By a 
suitable choice of the arbitrary zero point of the energy scale we can ensure that 
J = J'. 

Now we set the order parameter on the i^^'- site Si to +1 if the site is occupied by 
a copper atom and to —1 if it is occupied by a zinc atom. Then the system's partition 
function becomes 

Z = 5] exp ( - 1/3 JijSiSj), (6.8) 

which is identical with equation (6.3) for the Ising model's partition function in the 
case = 0. 



6.2 Evaluating Zi 



sing 



We have a chain of N dipoles, Sj, that interact only with their neighbours on each side. 
To keep everything symmetrical, we assume that the chain is wrapped round on itself 
to form a ring. So dipole interacts with dipoles 1 and — 1 and we consider that 
sjv = sq. We consider first the case B = Q. The partition function of the system is 



JV-l 



Si S 



N-1 

n n 

{si} i=0 



/3JsiSi+i 



(6.9) 



Now consider the expression for the TVth power of a 2 x 2 matrix T„ 
a, T take the values ±1: 



V / ( 



E 



X 



where the indices 



(6.10) 



13,1, 



=±1 



6.2 Evaluating Zi^i 



31 



The trace of this A?^th power will be identical with the right side of (6.9) if we define 
the transfer matrix to be 



exp 



Now (6.9) becomes simply 



Z = TrT 



N 



(6.11) 



(6.12) 



In this expression each of the matrix products involved in raising T to the TV*'' power 
introduces one of the sums in (6.9) over the possible values of a spin. 

In the limit of large N the trace of is easily calculated. The trace is invariant 
under orthogonal transformations of T, and since T is symmetric and positive it has 
real, positive eigenvalues and can be diagonalized by such a transformation. In the 
frame in which T is diagonal, T-^ is also diagonal, consisting of T's eigenvalues to the 
power N. Thus if we denote T's eigenvalues in order of decreasing magnitude by Aq, 
Ai, then 

lim -ln(TrT^)= lim - In |a^ [l + (Ai/Aq)^] 1 
= InAo, 



(6.13) 



where Aq is T's larger eigenvalue, which is easily shown to be 2cosh(/3i7). Thus the 
ring's partition function and Helmholtz free energy per site are 



Z = 2^ cosh''' (/3 J) and / = - ^ ln[2 cosh(^ J^)] . 



/3 



The internal energy per site, d(/3/)/d/3, is 



u = Jtanh(/3J), 



(6.14) 



(6.15) 



which is a perfectly smooth function of /3. So this system has no specific heat anomaly 
such as that which usually occurs at the Curie point of a ferromagnetic material. 

We can generalize the definition (6.11) of the transfer matrix to the case of non- 
zero B: 



T^r = exp [ - PJar + ^fifiB{a + t 



Q-P(J-B) 



(6.16) 



T is still symmetrical in its indices and we still have Z = TrT^. The lar ger eigenvalue 
of T is 



Ar 



-PJ 



cosh + ^cosh^^5 + (eW - i) 



so in the presence of a magnetic field the free energy per site is 



cosh + y^cosh^/35 + (eW - i) 



(6.17) 



(6.18) 
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Differentiating witfi respect to B we obtain tfie average of s: 



sinfi /3B 



(6.19) 




No matter wliat tlie value of /3 or the sign of J, (s) ^ as — ?• 0. Tiius the system 
never becomes a ferromagnet. Evidently this simply solved system does not exhibit a 
phase change. 

It is not difficult to understand why the Ising ring never makes the transition to a 
ferromagnetic state. Suppose that it did and that we examined the system with < 
at a very low temperature, when the great majority of the spins would be aligned. We 
consider how the entropy of the Universe would change if the spins in some section of 
the ring decided to flip over. The ring's energy would usually go up by — 4^7, so the 
entropy of the rest of the Universe, whence the energy came, would drop by —AJ/T. 
But the beginning and end of the flipped section can be placed in iV^ different places. 
So the entropy of the ring goes up by 2kB In A^. Hence for sufficiently large N , ffipping 
a section of the ring, even a large section, will always produce a net entropy gain no 
matter how low the ring's temperature. 

In 1944 Onsager showed that the d = 2 Ising model does spontaneously magnetize. 
It is instructive to see whether an extension of the argument we have just given enables 
us to anticipate this result. The energy required to flip a section of the Ising ring is 
independent of the length of the section. In higher dimensions the 'surface energy' of 
a flipped block of spins increases with the number of spins in the block. The minimum 
energy for a block containing a given number M of spins is achieved when the block is 
a d-sphere: if a is the lattice spacing and r the radius of the block, it is for d = 2 



For d = 3 the corresponding number is — 2i7(367rM^)^/^ — the energetic cost of flipping 
a block rises more sharply with the block's size the bigger d is. The gain in the system's 
entropy is harder to calculate. The centre of the block can be located at any of N sites, 
which gives us a straight entropy gain ks^^N. If the block had to be spherical, we 
could immediately conclude that flipping blocks with M comparable with N must lead 
to a net entropy decrease since the entropy cost of the energy, which is oc /3M^ with 
p = I or |, can be offset by an entropy gain oc InA^ only for M <^ N. However, the 
block doesn't need to be spherical, so we should consider the number of ways n^rap 
in which a boundary of given area can be wrapped around the centre. This is a hard 
problem, but one can see that riwrap must increase very rapidly with M if In(nwrap) 
is going to overwhelm and prevent the system settling to an ordered state at 
sufficiently low temperature. Hence it is not a priori unlikely that the d = 2 Ising 
model can spontaneously magnetize. 



2 J X 



2'Kr 
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Figure 4 Graphical solution of the mean-field equation (7.3) for the Ising model. 



7 Mean-field theory 

Consider the d-dimensional Ising model in a uniform external magnetic field B. We 
write down the thermal average of a certain spin Si given that the spins of its neighbours 
are constrained to take particular values: 



(7.1) 



exp 






— exp 






exp 






+ exp 




-B) 



Here, Jij arises rather than \ Jij because each pair of sites is counted twice in the 
sum over i and j of (6.1). To get the true value of (sj) we must now average over 
all the values that the neighbouring spins {sj} can take. The correct weights of these 
spin configurations involve the interaction of the neighbouring spins with their neigh- 
bours, and so on. In order to break into this ladder of successively more complicated 
equations for the thermal averages of more and more distant spins, we make a simple 
approximation. We see that the quantity B — JijSj acts in (7.1) just like an effec- 
tive magnetic field at site i; our approximation will be to replace this field by its mean 

This is the origin of the name mean- field theory. Equation (7.1) 



value B - J2j Ji 
then becomes 



(Si) 



tanh 



(7.2) 



We assume that Jij is given by (6.2) with J" < 0, so that the model has a tendency 
for spins to align, and put e = —J^. Since the system is translationally invariant, we 
do not expect the average (sj) of a spin to vary from site to site — it will be everywhere 
equal to the specific magnetization m. Equation (7.2) now becomes 



m = tanh[/3(zem -|- B)], 



(7.3) 



where z is the number of nearest neighbours (the coordination number). 
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A graphical solution of (7.3) with = is shown in Fig. 4. For small z/3e, (7.3) 
has no non-zero solution for m. However, as z/3e increases, a solution with non-zero m 
appears. The critical value /3c of /3 at which a non-zero average value of the total spin 
becomes possible is given by 

z/3,e = l. (7.4) 

Equation (7.4) predicts that /3c should depend on the geometry of the model only 
through the coordination number z and that it is finite for all z 7^ 0. We know, however, 
that spatial dimensionality is crucial in determining the behaviour of the model. In 
particular, we saw that the one-dimensional Ising model has no ordered phase. In two 
and three dimensions mean-field theory is more successful; equation (7.4) correctly 
predicts that there will be a transition to non-zero magnetization, and the mean-field 
value for the transition temperatures on square and cubic lattices (/3ce = 0.25 for 
d = 2 and 0.133 for d = 3) are at least of the same order of magnitude as the true 
values, /3ce = 0.4407 for d = 2 from Onsager's work and (3c = 0.222 for d = 3 from 
numerical simulations. In every case mean-field theory has underestimated /3c and 
therefore overestimated Tc- This is a general property of mean-field theory. 

7.1 Mean- field theory of the non-ideal gas 

Let's apply mean-field theory to a non-degenerate non-ideal gas. Since the gas is 
non-degenerate, we can work in the classical picture, in which the gas has 2N phase- 
space coordinates, namely the coordinates Xj and momenta pj of the i = 1,...,N 
molecules. The molecules interact with each other through some potential $({xi}), so 
the partition function is 



id^xi . . . d^pjvd'x^ exp|-/3 [Yl^ + ^({^i})] }• (7-5) 



The mean-field theory of this gas consists in supposing that the effect on each particle 
of all the others can be represented by an effective single-particle potential (f){x) in 
which all the molecules may be assumed to move. The total potential energy then 
decomposes into a sum of terms, one for each particle: 

*({xj)^l](/'(x,). (7.6) 

i 

The integral in (7.5) now factorizes and we obtain 



d^pd^x exp {-I3[p'^/2m + (/)(x)]} 



N 



(7.7) 



The momentum and position integrals now separate. The momentum integral will 
contribute a volume-independent term to the free energy and we can therefore neglect 
it for the purpose of calculating the pressure exerted by the gas. 

The optimum form of the function (l){x.) will depend on the original potential $. 
For the purposes of the present calculation, however, let us assume a particular simple 
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form for it; let us suppose that it excludes each molecule from some volume T^x of space 
by means of an infinite potential barrier (in order to model the 'hard-core repulsion' 
of the other molecules) and takes some finite value u elsewhere (which will be negative 
because of the long-range attractive forces acting between molecules). Therefore, to 
within a volume-independent factor coming from the momentum integration in (7.7), 

Z= [{V-V,^)e-f'Y . (7.8) 

The free energy is then 

F = -NkBT[ln{V -V,^^) - ^u]. (7.9) 

The pressure may be found by differentiating with respect to volume at constant tem- 
perature: 

/dF\ _ NksT 
KdVJT ~ F - Fcx ^'ydVJr' 



P 



(7.10) 



Recalling that u was supposed to originate from the attraction of a given molecule for all 
the others outside the excluded-volume region, it is clear that u should be proportional 
to the density of molecules in the gas, u oc N/V. Presumably the excluded volume 
itself will be proportional to N. If we choose the constants of proportionality so that 



u 



a N 



(7.11) 



where Na is Avogadro's constant, then (7.10) becomes 

NkBT an" 

where n = N/N a is the number of moles of gas in our sample. Equation (7.12) is 
nothing but the van der Waals equation of state. 

Appendix A Solving (2.6) 

Let s{n) = Sn- Then eq. (2.6) is easily extended to 

s{rnnr • • •) = •^('^) + s{rn) + s{r) -|- • • • , (^-1) 

so with n = m = r = ■ ■ ■ we conclude that 

s{n'') = ks{n). (A.2) 

Now let u,v be any two integers bigger than 1. Then for arbitrarily large n we can 
find m s.t. 

(A.3) 



m \b.v m + 1 

-< -, < 

n mu n 

Since s is monotone increasing, 

m s(v) m + 1 
n s[u) n 
Comparing (A.3) with (A. 4) we see that 



m+l 



ms{u) < ns{v) < {m + 1)s(m) 



(A.4) 



s{v) Inv 



s{u) Inu 



1 

< - 

n 



s{v) s{u) 



In V In u 



< e, 



(A.5) 



where e = s{u)/ {n\nv) is arbitrary small. Thus we have shown that s{v) oc \nv. 
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Appendix B The method of steepest descent 

Suppose one wishes to evaluate a contour integral of the form 




(B.l) 



where 7 is some contour in the complex plane and g{z) is an analytic function of z. 
Then one can obtain an approximate value for / as follows. 

The modulus of the integrand is largest at the point Zg at which u{z) = ^e{g) 
peaks. At this point du/dx = du/dy = 0, where z = x + iy, so by the Cauchy-Riemann 
conditions, at Zg, v{z) = ^in{g) also has vanishing derivatives with respect to x and y. 
So at Zg, dg/dz = 0. Consequently, ^'s Taylor series about Zg reads: 

g{z) = g{zs) + y'{zs){z - Zsf + ■■■. (B.2) 

We write g"{zs) = — 2ae"^ and z — Zg = ee~"^ where a > 0, e, and <j)' are all real 
numbers. Then 

g{z) g{zs) - aS^-^'h^ (B.3) 

Now we deform the original contour of integration 7 until it passes through Zg along 
the line (p' = (p. The contribution to the integral / from points near Zg can now be 
written 

/' = e-'-^/^e^^^^^ f dee-"^'. (B.4) 

J small e 

Although the Taylor expansion on which this expression is based is valid only for 
sufficiently small e, the Gaussian nature of the integrand of (B.4) allows us to extend the 
limits of integration to ±00 without appreciable error. Thus from the usual Gaussian 

integral we have 

I' c^e-'"l'/^e3^'^^^/^. (B.5) 

The method of steepest descent consists in equating / with the contribution /' from 
the neighbourhood of Zg'. 



[ dze^ ~ e'^eg(^-M where 6 = avg[-g" (zg)]. (B.6) 

If we deform our contour so that is passes through the stationary point of g along 
the line (/)' = (/)±7r, the integrand has a minimum at the stationary point because it is 
then proportional to e^ . Thus the stationary point is a saddle point of the integrand. 
When we used the method of steepest descent to evaluate Zjv for an ideal gas, we 
located the stationary point as a minimum of the integrand along the real axis, and 
chose a contour that crosses the stationary point at right angles to the real axis, thus 
perceiving it as a maximum. 



